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\ Abstract: We define a class of growing networks in whicli new nodes are 

given a spatial position and are connected to existing nodes with a probability 
mechanism favouring short distances and high degrees. The competition of 
preferential attachment and spatial clustering gives this model a range of 
■ interesting properties. Empirical degree distributions converge to a limit 

p I ■ law, which can be a power law with any exponent r > 2. The average 

(-H . clustering coefficient of the networks converges to a positive limit. Finally, 

a phase transition occurs in the global clustering coefficients and empirical 
distribution of edge lengths when the power-law exponent crosses the critical 
value T = 3. Our main tool in the proof of these results is a general weak law 
of large numbers in the spirit of Penrose and Yukich. 
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1. Introduction 



Many of the phenomena in the complex world in which we live have a rough description 
^ ■ as a large network of interacting components. It is therefore a fundamental problem to 

^ ■ derive the global structure of such networks from basic local principles. A well established 

principle is the preferential attachment paradigm which suggests that networks are built by 
adding nodes and links successively, in such a way that new nodes prefer to be connected to 
existing nodes if they have a high degree The preferential attachment paradigm offers, 
for example, a credible explanation of the observation that many real networks have degree 
distributions following a power law behaviour. On the global scale preferential attachment 
networks are robust under random attack if the power law exponent is sufficiently small, and 
have logarithmic or doubly logarithmic diameters depending on the power law exponent. 
These features, together with a reasonable degree of mathematical tractability, have all 
contributed to the enormous popularity of these models. 

Among the many criticisms directed at preferential attachment models is a significant 
deviation of their local structure from that observed in real networks. In preferential attach- 
ment models the neighbourhoods of typical nodes have a tree-like topology [9l H] , which 
is a crucial feature for their global analysis, but is not in line with the behaviour of many 
real world networks. The most popular quantity to measure the local clustering of networks 
are the clustering coefficients, which are measured to be positive in most real networks, but 
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which invariably vanish in preferential attachment models that do not incorporate further 
effects A possible reason for the clustering of real networks is the presence of a hidden 

variable assigned to the nodes, such that similarity of values is a further incentive to form 
links. Several authors have therefore proposed models combining preferential attachment 
with spatial features in order to address the weaknesses of pure preferential attachment. 
Among the mathematically sound attempts in this direction are the papers of Flaxman et 
al. dnilTT], Jordan [13], Jordan and Wade [15], Aiello et al. [1] and Cooper et al. [6]. These 
papers show that combining preferential attachment and spatial dependence can retain the 
global power law behaviour while changing the local topology of the network, for example by 
showing that the resulting graphs have small separators [lOl [11] . But none of them discusses 
clustering systematically by analysing the clustering coefficients. 

In this paper we propose a natural model of a network in which the preferential attachment 
paradigm is modulated by spatial proximity. Our model is a generalisation and variant of 
the one introduced in Aiello et al. [T]. The model is best described as a growing network in 
continuous time. New nodes are born according to a Poisson process of rate one and placed 
uniformly on the one-dimensional torus of length one. A node born at time t is connected by 
an ordered edge to each existing node independently with a probability ip{tp/ f{d)) where d 
is the indegree of the older node at time t, and p is the distance of the nodes. The decreasing 
profile function ip: [0, oo) — )■ [0,1] and increasing attachment rule /: NU {0} — )■ (0,oo) 
are the parameters of the model. We show that this modification of the original idea of 
preferential attachment preserves the power law behaviour of existing preferential attachment 
models while significantly changing the local topology leading to a positive average clustering 
coefficient. We also observe interesting phase transitions in the behaviour of the global 
clustering coefficient and the empirical edge length distribution. 

Our analysis of this model is using methods developed originally for the study of random 
geometric graphs, see Penrose and Yukich |17j for a seminal paper in this area and [16] for an 
exhibition. This approach is new in the context of preferential attachment and quite different 
from the established route to study dynamical random graph models, which is based on the 
use of differential equations to study the evolution of expected quantities and concentration 
inequalities to relate them to the empirical quantities. By contrast, our analysis is based on 
a rescaling which transforms the growth in time into a growth in space. This transformation 
stabilizes the neighbourhoods of a typical vertex and allows us to observe convergence of 
the local neighbourhoods of typical vertices in the graph to an infinite graph. This infinite 
graph, which is not a tree, is locally finite and can be described by means of a Poisson point 
process. We establish a weak law of large numbers, similar to the one given in [T7j, which 
allows us to deduce convergence results for a large class of functionals of the graph. Some 
further work is required to show that certain rare effects, like vertices having a very high 
degree or being linked to distant vertices, do not affect our functionals. 

The paper is organized as follows. In Section 2 we present the model. The main results 
concerning the degree distribution, the clustering coefficients and the edge length distribu- 
tion, are stated in Section 3. In Section 4 we describe the general method and main tools 
developed for the study of the network. Section 5 completes the proofs of our main results 
and, finally. Section 6 briefly discusses some variants and further developments. 
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2. The model 

Write Ti for the one-dimensional torus of length 1 represented as M/Z endowed with the 
usual distance. Let X denote a Poisson point process of unit intensity on Ti x (0,oo). A 
point X = (x, s) in Af is a vertex x, born at time s and placed at position x. We say that 
(x, s) is older than (y, t) if s < t. An edge is always oriented from the younger to the older 
vertex. Observe that, almost surely, two points of X neither have the same birth time nor 
the same position. For t > 0, write Xt for Af n (Ti x [0, t)), the set of vertices already born at 
time t. We construct a growing graph (Gt)t>o, starting from the empty graph, and adding 
successively the vertices in X when they are born (so that the vertex set of Gt is Xt), and 
connecting them to some of the older vertices. The rule is as follows: 

Construction rule: Given the network Gt- and y = (y, t) G X , we add the 
vertex y and link it to each older vertex x, independently, and independently 
of X, with probability 

Here the following definitions and conventions apply: 

(1) (i(x,y) denotes the length of the edge (y,x), which is the usual distance in Ti (for 
which, by a minor abuse of notation we also use the notation d) between the spatial 
positions of the vertices x and y. 

(2) if. [0, oo) — )• [0, 1] is the profile function. It is supposed to be non-increasing, and of 
total integral 1/2. Informally, it describes the spatial dependence of the probability 
that the newborn vertex y is linked to the existing vertex x. 

(3) Zx(t— ) (resp. Zy!^(t)) denotes the indegree of vertex x at time t— (resp. t), that is, 
the total number of incoming edges for the vertex x in Gt- (resp. Gt)- Similarly, we 
denote by Yy the outdegree of vertex y, which remains the same at all times u > t. 

(4) /: NU {0} — )• (0,oo) is the attachment rule. It is supposed to be non-decreasing. 
Informally, it quantifies the preferential "strength" of a vertex of current indegree k, 
or likelihood of attracting new links. We assume that the attachment rule / has an 
asymptotic slope 

7:= lim 4^ G [0,1). 

Note that, for any r > 0, the profile function ip and attachment rule / together define the 
same model as the profile function '^{■r) and the attachment rule rf. The normalisation 
convention f ip = ^, which will always be assumed for convenience, represents therefore no 
loss of generality. 

In our setup it is not directly the linking probability which is multiplied by the preferential 
attachment factor f{Z^{t—)), but the spatial expansion of the influence profile 

y^^' + ^i /£(!-)] )■ 

around the vertex x = (x, s) at time t. The probability of connecting a new vertex to an old 
one is given by the value of the influence profile around the old vertex at the position of the 
new one. In the important special case of the profile function (p{r) = l{r < which only 
takes the values zero or one, this decision is not random. In this case a vertex x is linked to 
a new vertex born at time t if and only if their positions are within distance f {Zy^{t—)) / {2t) . 
In other words, every vertex x is surrounded by an influence region, a ball of time-dependent 
radius /(Zx(i— ))/(2t), and a new vertex is linked to all older vertices in whose influence 
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region it falls at the time of its birth. This special case already reveals the complexity and 
interest of the model, and the reader is encouraged to first figure out its behaviour. 

The model introduced by Aiello et al. [1] and further studied by Cooper, Frieze and 
Pralat [6], and by Janssen, Pralat and Wilson [12] is essentially the same model for the 
special case that the attachment rule is affine and the profile function is of the form ip{x) = 
pl{x < l/{2p)}. Small differences are that they work in discrete rather than continuous 
time, and allow for spaces more general than Ti, but these differences are inessential for the 
purposes of this paper, see also our comments in Section 6. 

Recall the definition of the asymptotic slope 7 of the attachment function from (4). If 
7 > this means that / is asymptotically linear, and this is known, in non-spatial preferential 
attachment models, to lead to scale- free networks with power law exponent r = 1 + ^• 

We now illustrate the connection between non-spatial preferential and spatial attachment 
models. Suppose the graph Gt- is given, and a vertex is born at time t, but we do not know 
its position, which is therefore uniform on Ti. Then, for each vertex x = (x,s) € Gt~, the 
probability that it is linked to the newborn vertex is equal to 



As a consequence, the process (.^x(i))i>s is a continuous time inhomogeneous Markov chain, 
starting from 0, with integer values, and jumping at time t from state k to state k + 1 with 
intensity 



This quantity is bounded by f{k)/t. As the process (-^x(^))t>s grows of roughly like 
(see Lemma [8] for a precise statement), the normalisation of if makes this bound asymp- 
totically sharp. Hence the jumping intensity of our process is the same as in the classical 
Barabasi- Albert model of preferential attachment O [TB], or its variant studied by Dereich 
and Morters [HI [9] . Not surprisingly, our spatial model exhibits the same limiting indegree 
distribution. 

However, as soon as one deepens the study of the graph further than the first moment 
calculations, the essential difference with the non-spatial models appears. The presence of 
edges is now strongly correlated through the spatial positions of the vertices. These strong 
correlations both make the model much harder to study, and allow the network to enjoy 
interesting clustering properties. These are the main concern of this paper and will be 
described in the next section. We will henceforth use the common notation g = o{h) to 
indicate that g/h converges to zero, g >i h \i g/h \s bounded from zero and infinity, and 
5 ~ /i to indicate that g/h converges to one. 



3.1. Indegree distribution. While the indegree of a given vertex grows indefinitely with 
the size of the network, the mean indegree in the graph Gt converges to a limiting distribution 
with polynomial decay. More precisely, for t > such that Xt is nonempty, denote by 
the law of the indegree of a randomly (and uniformly) chosen vertex in the graph Gt, or 
empirical indegree distribution. More formally, the empirical indegree distribution is the 
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random measure on N, which gives to each /c E N the weight 

if 7^ and f^t{k) = l{k = 0} otherwise. We introduce the probabihty measure ^, 
determined by its weights 

For any measure A on N and any function g: N — )■ [0, oo), we write (A, g) for the expectation 
of g under the law A, or X^fc>o ^{k)g{k). The following theorem states a convergence result 
for the empirical indegree distribution /i^ to the probability measure which we call limiting 
indegree distribution. This result implies, in particular, convergence in probability, in the 
total variation norm. 

Theorem 1. For any nondecreasing function ^r: N — )• [0, oo) satisfying {fJ,,g^) < oo for some 
p > 1, the following limit holds 

in probability, when t — )• oo. 
Remark 1. 

• The convergence in the theorem still holds for any function g, not necessarily positive 
or monotonous, but with g{k) = o{k^) for some 6 < I/7. 

• It is easy to check that, if 7 > 0, the limiting distribution fi satisfies 



which highlights the scale-free property of the network with exponent r = 1 + I/7. 
In the particular case of a linear attachment rule f{k) = 'jk + /3, with 7 G (0, 1) and 
/3 > 0, we have 

'^^ '"Tr(t + te±i)r(iJ, jTii)" ' 

a result that has already been obtained for their variant of the model in Theorem 1.1 
of Aiello et al. [1] by a completely different technique of proof. 

Our result shows that under our normalisation convention, the profile function has no 
influence on the degree distribution. Note however that in the presence of spatial depen- 
dence the normalisation of the profile function typically enforces a significant change to the 
attachment rule. As an example, we look at the case when the vertex y born at time t 
connects to vertex x with probability 



t°d(x,y)" 



A 1 



for a > 1. In our setup, this must correspond to the profile function (p{r) := {-^^ r)~" A 1 
and the attachment rule f'{k) := -^^f^^"{k). Thus, if /^^^ is approximately linear with 
slope 7 the resulting power law exponent is r = 1 + 
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3.2. Outdegree distribution. In the original preferential attachment model of Barabasi 
and Albert, the outdegree is constant. In the model variant of Dereich and Morters, it is 
asymptotically Poisson, therefore it is light-tailed, which implies that it is not relevant in the 
study of the tail of the degree distribution. In our model, the limiting outdegree distribution 
is not Poisson, and we could not find a closed formula defining it. Still, we prove that it is 
light-tailed. 

Denote by vt the empirical outdegree distribution in the graph Gt, defined by its weights 



ii Xt and vt{k) = '^{k = 0} otherwise. The following theorem holds: 

Theorem 2. There exists a probability measure on N such that: 

(1) For any function (7: N — )• M satisfying g{k) = o{e^^) for some < 5 < 1 — 7, we have 

in probability, when t — >• 00. 

(2) The measure v is light-tailed in the following sense: For any < 6 < 1 — j, we have 

z/([/c,-Foo)) = o(e~*^'). 

The limiting outdegree distribution is implicitly defined (see formula (|5.2p below), but 
it is not easy to compute explicitly. Moreover, it is not hard to see from our proofs that the 
indegree and the outdegree of a randomly chosen vertex are asymptotically independent and 
hence the limiting total degree distribution is the convolution fi* v. 

3.3. Clustering. We now define the clustering coefficients for a finite simple graph G = 
{V,E), with unoriented edges, such that at least one vertex has degree two or more. A 
subgraph of G containing exactly three distinct vertices and the three edges linking them 
is called a triangle. A subgraph of the form ({x, y, z}, {{x, y}, {x, z}}) is called an open 
triangle with tip x. In other words, an open triangle with tip x consists of the vertex x and 
two of its neighbours y and z, which could either be connected and hence form a triangle, 
or not. 



The global clustering coefficient of G is defined as 
cgl°b(c) := 3 



glob/^\ o Number of triangles included in G 



Number of open triangles included in G ' 

if there is one open triangle in the graph, and c^^°^{G) = otherwise. Note that always 
c^^°^{G) G [0, 1]. The local clustering coefficient of G at a vertex x with degree at least two 
is defined by 

loc/Q\ Number of triangles included in G containing vertex x 
^ Number of open triangles with tip x included in G ' 

which is also an element of [0, 1]. Finally, the average clustering coefficient is defined as 
if the set V2 of vertices with degree at least two in G is not empty, and c^^{G) = otherwise. 



SPATIAL PREFERENTIAL ATTACHMENT NETWORKS 



7 



Theorem 3. 

(1) Average clustering coefficient: 

There exists a strictly positive number G (0, 1) such that 

in probability, as t ^ oo. 



(2) Global clustering coefficient: 

There exists a nonnegative number cio° € [0,1) such that 

in probability, as t ^ oo. This number is positive if and only if^k'^^{k) < oo. In 
particular, we have 



i/7>i 



Remark 2. Our proofs allow us to write cio° and as multiple integrals over the network 
parameters. Note that the phase transition in the global clustering coefficient occurs when 7 
crosses the value one half, which coincides with the power law exponent crossing the critical 
value r = 3. 

Remark 3. The global and average clustering coefficients have the following probabilistic 
interpretation: 

• Pick a vertex uniformly at random and condition on the event that this vertex has 
degree at least two. Pick two of its neighbours, uniformly at random. Then the 
probability that these two vertices are linked is equal to c^^(G). 

• Pick two edges sharing a vertex, uniformly from all such pairs of edges in the graph. 
Then the probability that the two other vertices bounding the edges are connected 
is equal to cS'°^(G). 

Here is an informal discussion of the clustering phenomenon. For a randomly chosen ver- 
tex, both the number of open triangles with tip in that vertex as well as the number of 
triangles containing it converge to a finite random variable. The ratio of these variables de- 
termines the average clustering coefficient, which therefore is always positive. To understand 
the phase transition in the behaviour of the global clustering coefficient, first note that, as 
the outdegree distribution is always light-tailed, new vertices typically generate a bounded 
number of triangles and hence the number of triangles in the network grows linearly in time. 
If ^ k'^fi{k) < 00 the average number of open triangles per vertex is finite, and so the number 
of open triangles also grows linearly in time, and the global clustering coefficient is positive. 
However, if this sum is infinite, the total number of open triangles has superlinear growth, 
which is enough to guarantee that the global clustering coefficient vanishes. In this case, 
the tip of a randomly chosen open triangle is typically a very old vertex with a high degree. 
This is best seen in the case 7 > ^, in which the degree of the first born vertex at time t is 
of order , so that this vertex alone gives rise to a superlinear number t^"^ of open triangles. 
Observe that these effects match the structure of real networks. For example, if you pick a 
webpage at random, and click on two hyperlinks, it is likely that the two pages you get have 
actually a direct hyperlink. Now, if you pick two webpages which both have an hyperlink to 
the Google homepage, it is not likely that these two pages have a direct link. 
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3.4. Edge length distribution. In the graph G^, we could hope that a typical edge con- 
nects two vertices with birth times of order t and degrees of order one. We would then expect 
from the construction rule (|2.ip that its length is of order 1/t. This description is actually 
always valid within our range of parameters (it would be false for 7 > 1), and explains the 
rescaling below. 

Write E{Gt) for the set of the edges of the graph Gt- Define A, the (rescaled) empirical 
edge length distribution, by 

' ^ (x,y)ei=;(Gt) 

if E[Gt) 0, and = otherwise, where bu is the Dirac measure giving mass one to \u\. 

Theorem 4. There exists a probability distribution A on the real line such that: 

(1) For every continuous and bounded g: [0, 00) — )• M we have 

(At, 5) — > (A,ff), 

in probability, when t — t- 00. 

(2) Suppose that there exists b > 1 such that the profile function satisfies tp{x) x 1 Ax~^. 
Then 

X{[K,+oo)) X lAK-'', 
where rj E (0, 1] is the smallest of the three constants 1, ^ ~ 1 o,f^d (5—1. 
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Figure 1. Level sets for the length exponent r] in the ((5,7) plane consist of 
a rectangular block corresponding to the value rj = 1 and a family of lines 
starting vertically at the (5-axis and turning horizontally upon hitting the 
graph given by (57 = 1. 
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The heavy tails of the empirical edge length distribution highlight the nature of our net- 
works as small worlds. Observe that the distribution A never has a first moment, implying 
that the mean edge length is of larger order than 1/t. As the profile function if is integrable, 
if it decays polynomially, it must be of order for some 6 > 1. If 6 > 2, then the profile 
function does not infiuence the decay rate of the tail of the limiting edge length distribution. 
This stays true if (p is any function satisfying J vip{v) dv < oo. Conversely, a choice of f can 
lead to any exponent within (0, 1] if 7 < 1/2, or within (0, 1/7 — 1] if 7 > 1/2, see Figure 1. 

In Janssen et al. |12j the empirical edge length distribution is studied for the model defined 
in [1]. This is essentially the case of an affine function f{k) = -\- (3 and a profile function 
= p\{x < l/{2p)}, corresponding roughly to the case 6 = 00. They show that if 7 > ^ 
and 1^ < a < 1, then 

I {edges of length longer than I 

for an explicit constant C > 0. Our result uses a different order of limits, but leads to the 
same order of growth for the comparable quantity iA[t^~",oo). If 7 < ^ they show that the 
expected number of edges of length longer than for < a < 1, grows of order which 
is also of the same order as tX[t^~", 00). Note that the general form of the profile functions 
allows for a genuinely richer phenomenology in our case. 



4. Methods of proof 

4.1. The rescaled picture. First of all, it is convenient to describe more explicitly the 
randomness involved in the 'construction rule', which determines the presence or absence of 
each edge in the network. To this end, denote by Ti x (0, 00) the set of potential vertices, 
and by 

^(Ti X (0,00)) := {(y,x),y,x G Ti x (0,oo),y younger than x} 

the set of potential edges. Introduce a family V of independent random variables, independent 
of X, indexed by the set of potential edges, and all uniformly distributed on [0, 1]. We will 
denote these variables by Vx,y or V(x, y). A realization of Xt and V defines a network 
G^(A'f, V), with vertex set Xt, obtained with the same construction as before, but with the 
construction rule replaced by the rule that you connect x to y iff 

where s is the birth time of the younger vertex y. The growing networks {G^{Xt, V))t>o and 
{Gt)t>o have the same law and will be identified. Besides, the deterministic functional 
associates a graph structure to any set of points in Ti x (0, 00) and family of points in [0, 1] 
indexed by E{Ti x (0,oo)) . 

Second, we want to generalize the construction, replacing Ti by Tt = M/(tZ), the one- 
dimensional torus of length t. We authorize the case t = 00, with the convention Too = 
The definition of the set of potential vertices x (0, 00) and the set of potential edges 
E{Tt X (0,00)) is straightforward. We define the functional G*, for t G (0, 00], in analogy 
to the case t = 1, by associating a graph structure to any set of points in x (0, 00), and 
any family of values in [0, 1] indexed by E{Tt x (0, 00)). In the construction, the rule (j4.ip 
is unchanged, but with the new understanding that the distances are now those in Tf. 
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For finite t, we introduce tlie rescaling mapping 

hf. Tix(0,t] ^ Ttx(0,l], 
{x,s) I—)- {tx,s/t) 

which expands the space by a factor t, the time by a factor 1/t. The mapping ht operates 
on the set X, but also on V, with 

ht{V)ht{yi),ht(y) •= ^^,y 

The operation of ht preserves the rule (j4.ip . and it is therefore simple to verify that we have 

G\ht{Xt),ht{V)) = ht{G\Xt,V)) = ht{Gt), 

that is, it is the same to construct the graph and then rescale the picture, or to first rescale 
the picture, then construct the graph on this rescaled picture. Observe also that ht{Xt) is a 
Poisson point process of intensity 1 on x (0, 1], while ht{V) is still an independent family 
of i.i.d. uniform random variables on [0, 1], indexed by E(Tt x (0, 1]). 

Prom now on, we denote hy X a Poisson point process with intensity 1 on M x (0, 1], and V 
an independent family of i.i.d. uniform on [0, 1] random variables, indexed by E{M x (0, 1]). 
For each t > 0, identify (— 1/2, t/2] and T^, and write X* for the restriction of A' to x (0, 1], 
and V* for the restriction of V to the indices in E{Tt x (0, 1]). In the following, we write 
G^{X,V) for G*(A'*,V*), and consider the graph-valued process t i— )■ G'^{X,V). For fixed 
t £ (0,oo), this graph has the same law as ht{Gt)- However, the process behaves very 
differently from the original process t Gt- Indeed, in the original process, the degree of 
any fixed vertex grows like (see Lemma [8|) and thus goes to +00. By contrast, for the 

graphs G^{X,V), we have the following nice convergence property. 

Proposition 5. 

(i) The graph G°°{X,V) defined above is almost surely locally finite, in the sense that 
its vertices all have finite degrees. 

(ii) The graph G^{X,V), almost surely, converges locally to G°^{X,V), in the sense that 
for each x £ X, for large t, the neighbours of x in G*{X,V) and in G°°{X,V) 
coincide. 

As a direct consequence we obtain the following corollary. 

Corollary 6. Almost surely, for any x £ X and each n € N, the neighbourhood of vertex x 
in the graphs G''{X,V) and G°°{X,V) up to graph distance n will coincide for large t. 

The key to the understanding of the drastically different behaviour of the graph-valued 
process t 1— >• G*{X,V) lies in the fact that a fixed vertex in this sequence of graphs has a 
birth time which is comparable to the age of the network. This age would be highly variable 
in time if mapped onto the original graph, but is kept constant in the process 1 1— t- G*{X,V). 

Regardless of the strength of Proposition [5l it only states a local convergence result and 
is therefore insufficient for our purpose. Global results require the introduction of a specific 
law of large numbers, which we state and prove now. 

4.2. A general weak law of large numbers. For xq € R, we introduce the translation 

e^f^: Mx(0,l] ]Rx(0, 1]. 

(x,s) I—)- {x + xo,s) 

The translation O^f^ operates on M x (0, 1], and in a canonical manner also on the point sets in 
Mx (0, 1], and on families indexed by E(M.x (0, 1]). Consider a functional .^oo) which associates 
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a nonnegative real number S,oo{{x, s),Z, W) to a point set -H C M x (0, 1] with a distinguished 
point X = {x,s) £ Z, and a family W of numbers in [0, 1] indexed by E(M x (0, 1]). The 
functional is supposed to be translation invariant, in the sense that 

eoo(x,Z,W) = eoo(^x.o(x),^,.,(^),^,JW)). 

Similarly, for each t > 0, and xq E T^, we introduce the translation 

ei^: Tix(0,l] ^ T<x(0,l], 
{x,s) I-)- {x + xo,s) 

and we consider functionals ^t, which associate a nonnegative real number (,t{{x, s),Z, W) to 
a point set Z cTt x (0, 1] with a distinguished point {x, s) £ Z and a family W of numbers 
in [0, 1] indexed by E{Tt x (0, 1]). The functionals are supposed to be invariant under the 
translations O^.^. 

Finally, for the sake of simplifying the notation, we will also write ,^oo(x, 2^, W) for 
^oo(x, ZU{x.}, W) when the set Z does not contain x, and similarly .^((x, Z, W) for .^((x, ZU 
{x}, W). We also write 

eoo(2, W) := [' s),Z, W) ds, it{Z, W) := C et((0, s),Z, W) ds. 

JO Jo 

Recall the notation of the Poisson point process X and of the family of random variables V, as 

well as their restrictions and V*. In the following theorem, U denotes a random variable, 

uniform on (0, 1], and independent of the point process X and of V. 

Theorem 7 (Weak law of large numbers). Suppose that the following two conditions hold: 

(A) As t ^ oo, the random variable ^^((0, C/), Af*, V*) converges in probability to the 
random variable i^oo((0, ?7), , V); 

(B) For some p > 1 we have the uniform moment condition 

supE[6((0,[/),A'*,V*)P] <oo. 
t>o 

Then, as t ^ oo, we have the following convergence in the -sense, 

i ^ U^,X\V)^nUmU),X,V)]=E[C^{X,V)]. (4.2) 

Remark 4. 

(i) Theorem [7] is an adaptation of Theorem 2.1 of Penrose and Yukich |T7] to our pur- 
pose. Their result also includes a de-Poissonization, but this is incompatible with 
our set-up because of the explicit time dependence of the attachment probabilities. 

(ii) Suppose now that only condition {A) is satisfied. On the one hand, the proof still 
works if the family (^t((0, U),X'', V*))t>o is uniformly integrable. On the other hand, 
if E[^oo(A'iV)] = oo, then, by applying the theorem to the bounded functional 
^f{-x,Z,W) := ^i(x, Z,>V) A k and letting k go to oo, we get the convergence in 
probability of 

1 J;6(x,A'^v*) 

to -|-oo. The only case when the theorem does not yield any convergence result is 
when E[^oo(A', V)] is finite but the family (^t((0, C/), Af*, V*))t>o fails to be uniformly 
integrable. 
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Proof. As in Theorem 2.1 in [T7] the proof rehes on a first moment calculation, then a 
second moment calculation which is performed under a stronger uniform moment condition, 
and finally a step to allow the removal of this extra condition. 

First moment: We compute 



ds E[et((0, s), X'y)] = E [^t{{0, U),X\V')] . 

Jo 



Note that in all these expressions but the first one, a point is added to X*. The second equal- 
ity follows from the spatial invariance by the translation 9Lx, both of the functional and 
of the law of (Af*, V*). Now condition (A) states that the variables ^f((0, U),X* , V*) converge 
in probability to ^oo((0, U),X, V). Condition (B) ensures that they are uniformly integrable. 
Therefore we have convergence of the expectations E[,^(((0, [/), Af*, V*)] to E[^oo((0, U),X, V)], 
and this expectation is finite. 

Second moment: We work here under the stronger assumption that the uniform moment 
condition holds for some p > 2. Similarly as in the case of the first moment, we get 



E 



= e[1 J;6(x,^*,V*)']+Ie[^ E 6(x,^*,V*)6(x',^*,V*) 



+ E[^t{{tXi,Ui),X' U {{tX2, C/2)}, V*) mtX2, U2),X' U {(iXi, C/i)}, V*)] , 

with Xi and X2 uniform in Ti, Ui and U2 uniform in (0, 1), and X , Xi, Ui, X2, U2 inde- 
pendent. The first term goes to zero, thanks to the uniform moment condition with p > 2 
[p = 2 would be enough). 

Now, the second term is the expectation of the following product of random variables, 

6((o, c/i), eU^^ {X') u mx2 - Xi), U2)}, qUx, (V*)) 

X 6((o, ^2), u - X2), f/i)}, 0i,.Y,(V*)), 

whose behaviour we have to understand. We first concentrate on the first term. Write 

X' := eUxM') U {(*(^2 - ^1), f/2)}, V* := eiix,(V*). 

We introduce three events, Et := {td{Xi,X2) > Vi}, Ft := {td{Xi,l/2) > Vi/2}, and Gt 
the event that the Poisson point process X has at least one point in {{x,s): d{x,0) > y/i}. 
These are all asymptotically almost sure (a.a.s.), in the sense that their probability goes to 
one when t — )• 00. We make two important observations: 

• On the event Ft, the restrictions to x (0, 1] of the sets X^ and 0-tXi (X) coincide. 
Similarly, the restrictions to x (0, 1] of the families V* and 9^tXi (V) also coincide. 

• The law of (A'*, V*) knowing Et equals the law of (A'*, V*) knowing Gt- 
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These observations allow the following calculation, with rj some positive real number. Note 
that we will apply now (and until the end of this proof) the functional to point sets on 
M X (0, 1] or Tt X (0, 1] (and families indexed by E{R x (0, 1]) or ^(T* x (0, 1]) ). This is only 
to lighten the notations a bit. It should always be understood that the functional is applied 
to the restrictions on x (0, 1]. 

hmsupPj U{0, Ui),X', V*) - C^((0, C/i), 9.tx, W,0-tx, (V))| > r?} 

= limsupE[ll{|et((0, Ui), X\V') - ^^((0, f/i), O.^x, {X),e_tx, (V))| > rj}tF,\Et] 

t—^co 

= limsupE[]l{|et((0,C/i),^*,V*)-C^((0,C/i),^*,V*)| >77}]li.Ji?j] 
= limsupE[l{M{0,Ui),X'y)-C^^{{0,Ui),X'y)\>vnFpt] 

t—>oo 

= lhnsupF{M{0,Ui),X\V')-C^^{{0,Ui),X'y)\>v}=0. 
The last equality uses condition (A). Hence, the variable 

6((o, Ui),x\ V*) - e^((o, Ui), e^txA^), o^tx, (v)) 

converges in probability to zero. Similarly, one can see that the variable 

6((0, [/2), U {(^(Xi - X2), C/i)}, 0L,^,(V*)) - ((0, C/2), 0_tx.(A'), 0iiX.(V)) 

converges in probability to zero. Next, observe that the two variables 

C^,iio,Ui),e^tx,{x),e^txAy}) 

and 

C^,{{o,U2),e.tx,{x),9^tx,m 

are independent conditionally on the event Ef. Moreover, observe that the law of each one 
converges to that of Coo((0, Ui), X, V), thanks to condition (A) again. Gathering the results, 
we get that the product in (j4.3p converges in law to the product of two independent copies 
ofC^{{0,Ui),X,V). 

Finally, use Cauchy-Schwarz to get a uniform moment condition for this product for | > 1. 
Hence the expectation of the product goes to E[^oo((0, Ui), X, V)]^. Therefore we get ()4.2p . 
with convergence even in L^. 

Relaxing the moment condition: We finally work under the assumptions of the theorem, 
that is, the uniform moment condition is satisfied only for some p > 1. Introduce the bounded 
functional 

cH^,2,w) ■.= Ct{^,z,w)Ak. 

This functional clearly satisfies condition (A) and the uniform moment condition for any p, 
in particular for some p > 2. Therefore, we get the convergence of 

to E[^oo((0, Ui),X,V) A k], in L^, and thus in Now note that 

^il E m^,s),x'y)-c^i{x,s),x\v')) 

= E [^i((o, u),x'y) - Ct'((o, u),x'y)] , 
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which is nonnegative and goes uniformly to zero for k — )• oo, as the variables '^t((0, f/), Af*, V*) 
are uniformly integrable, by the uniform moment condition. It follows that 

i ^ 6(x,Af*,V*) 

converges in to the limit of E[^oo((0, U),Xy) A fc], that is E[^oo((0, U),X,V)]. □ 

4.3. A bound on the indegree and on the linking probability. As we consider various 
graphs on various spaces, we need to introduce more flexible notation for the degrees. If G 
is a graph with vertices in Tt x (0, oo) we write x f-?- y to indicate that there is an edge 
between the vertices x and y. If xq = (rco, sq) is in G, then, for any s > sq, we define 

Zx(,(s,G) = \[{x,r) £ G: (x,r) o {xq,sq),sq < r < s}|, 

the indegree of xq in G 'at time s\ and 

Y^^{G) = |{(x,r) G G: (x,r) o {xQ,SQ),r < so}\, 

its outdegree. For t £ (0, oo] and for < sq < s < 1, we write 

4o(^) = ^xo(5,G*(A'U{xo},V)) and = Y^,iG\X U {xo},V)). 

For fixed t and xq, call (Z^^(s))sg<s<i the indegree process. In this part only, we extend 
the Poisson point process X on the whole M x (0, oo), and allow any < sq < s in the 
definition of ^^^(s). For xq G Ti x (0, oo), the process (■^xo('^))s>so same law as the 

process (Zxo(s))s>s(, introduced earlier in Section 2, so that the results of this part apply 
simultaneously for the rescaled graphs and for the unrescaled ones. Now, observe that the 
law of the indegree process does not depend on the spatial position xq £ Tt- Therefore, we 
simply write Zl^{s) for Z'*q^^^(s) and Y^^ for ^(qsq)' ^ vertices in X, we 

write X o y for the event that x and y are linked in G''{X, V). 

Lemma 8. For all t > and xq G T^, we have almost surely 

In (s) 7 In s as s ^ oo. 

This lemma confirms that the degree of a fixed vertex in the unrescaled graphs grows 
polynomially with parameter 7, and in particular that it explodes. Before proving it we give 
a bound on the probability that a vertex reaches an exceptionally high degree, allowing it 
to be connected to an exceptionally distant vertex. Exponential bounds, uniform in t, are 
provided in the following lemma and its corollary. For the sake of simplicity, they are only 
stated in the case of a linear function /. We refer to Remark [5] for the general case. 

Lemma 9. Suppose f{k) = + (3, with 7 G (0, 1) and /3 > 0. Let p = — 1], so that 

f{k) < 7(/c + p + 1). For any t G (0,oo], any sq < s < 1 and any k > 0, the following 
inequality holds: 

F{zl^{s)>k}<eU^p{-l{^y). (4.4) 

Corollary 10. Define the inverse of the profile function ip by 

ip~^{u) := inf{x > 0: ip{x) < u}. 

There is a constant c depending only on f, such that for any t G (0,oo] and any {x,s) G 
M X (0, 1] , we have 

P{(0,1) o (x,5) I (0,l)GA',(x,s)G;f,V(o,i),(.,.) = n}<cexp(-s:^g^). (4.5) 
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Remark 5. In the nonlinear case, we can first bound / from above by a linear function, 
then, by an easy stochastic domination argument, get the inequalities of the lemma and its 
corollary with the linear bound instead of /. We get almost equally good bounds. More 
precisely, for any 7' > 7, we can find /3' > such that f{k) < ^'k + [5' for any natural number 
fc, and we thus get bounds for any exponent 7' > 7. 

A first corollary of Lemma [9] is that the indegree (s) is always almost surely finite, 
even when t = 00. The fact that the same holds for the outdegree is established later in 
Proposition [13l 

At this stage, let us discuss the important monotonicity property. If we fix sq and s and 
let t grow to +00, then Zl^{s) will grow and converge to Z^{s). Moreover, if we change the 
position of the vertex to be non-zero, we do not change the law of its indegree and therefore 
its indegree will still be stochastically increasing in t and stochastically dominated by Z^{s). 
By contrast, no such property holds for the outdegree. Indeed, increasing t may increase 
the distance of two vertices near opposite ends of the boundary of [— f , f], thus decreasing 
the indegree of the younger vertex which, in turn, might destroy further links, eventually 
reducing the outdegree of the vertex at the origin. 

Proof of LemmalEi We fix sq > and start with the case t = 00. The indegree process 
is an inhomogeneous continuous time Markov chain, starting from Z^{so) = 0, 
and for which, at time s, the transition density from state k to state A; + 1 is f{k)/s. Indeed, 
given Zsq{s) = k, we have Z^{s + ds) > + 1 if and only if the set 

{y,u)eX:ue{s,s + ds], V((0, 1), (y, u)) < ip [^^) } 

is nonempty, which happens with probability ds + o{ds). We introduce a logarithmic 
change of time and write 

Z(n):=Z-(soe-). 

Then the process Z is an homogeneous Markov chain, with jumping intensity from state k to 
state k + 1 equal to f{k). Write := inf{u: Z[u) > /c} for the first time when this Markov 
chain hits state k. Then the (Tj+i — Tj)j>o are independent and for each i, the variable 
Tj+i — Ti is exponential with parameter f{i). The process 

fc-i ^ 
Mk ■.= Tk-Y^ —— 



j=0 



is a martingale, which is bounded in L? and thus convergent. Hence, we have T^. ~ ^In/c, 
and further 

lnZ(u)~7n and InZ^(s) ~ 7lns. 
For the case of a finite t, we first get, from the monotonicity property, the bound 

limsup — < 7. 

s— !>oo In s 

The process (■^sg(soc"))s>so is an inhomogeneous continuous time Markov chain with tran- 
sition density from state k to state + 1 equal to 

at 



2/(fe) ['"'\{y)dy, 
Jo 



1+7 



which is equivalent to f{k), uniformly in k < s 2 ^ when t goes to +00. The same arguments 
as in the case t = 00 then show that we still have In Zl^{s) ~ 7 Ins. □ 
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Proof of Lemma [3 By the monotonicity argument we can suppose t = oo and, as before, 
we study the chain Z and its hitting times T^. The parameter of the exponential variable 
Tj+i — Ti is /(«), which is less than or equal to (p + i + l)^. It follows that is stochastically 
dominated by a sum of independent exponential random variables with parameters (p + 1)7, 
{p + 2)7, . . . , (p + /c)7, respectively. 

Let (Tj)i<j<fc_|_p be a family of i.i.d random variables, each following an exponential law 
with the same parameter 7. Let (rij , rVj, Tj^.^^) denote their decreasing rearrangement, 
and Tj^_^p^-^ =0. Foi 1 < j < k + p, let Tj = n. - n.^^. Then the family {Tj)i<j<k+p is 
independent and Tj is an exponential variable with parameter J7. Observe also that 

Tp+l + ... + Tp+fc = Tjp^-^. 

Hence, 

P{Z-(5oe")>fc}<P{x,^^, <n}. 

Now write 

k+p 

i=i 

The sum of indicators follows a binomial law of parameters k+p and exp(— 7u). Recall the 
concentration inequality for binomial random variables X, 

P{X<E|X1-A}<exp(-^)^ 

We apply this with A = ^{k +p) exp(— 7ti) and get 

k+p 

P{ t{Tj > < p} < exp ( - |e-^")l{2p < fce"^''} + t{2p > ke-^""] 

<exp(p/4)exp(-|e-^"). 
Finally, gathering the results, and taking n = Ins — Inso gives, for any /c > 0, 

P{Z-(s) > A:} < exp(p/4)exp (-| (^)^) , 
as required. □ 

Proof of Corollary 1 1 01 The event (0,1) -H- x coincides with the event that the indegree of 
vertex x at time one is large enough to ensure that the linking condition is satisfied. This 
indegree has the same law as Zl{l) and is independent of V((0, l),x). We thus get 

(0,l)<^x I (0,1) G A',xG A',V((0,l),x) =n} <p{(^(j(ifL_) > n} 



<P{^*(l)>r^(^)}<eiexp(-f 
<e4 8.exp(- g-Li ), 



1^1 _ §_ 



p , J_ 

yielding (|4.5p with the explicit constant c = e-* 87 . □ 

5. Specific proofs of the main results 

All the proofs of this section rely on the application of Theorem [7] to appropriate func- 
tionals. The functionals we use are only defined and used within each subsection. That is, 
the same notation in different subsections indicates different functionals. 
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5.1. Empirical indegree distribution. The following lemma provides the expected inde- 
gree of a vertex in the infinite graph with age uniform on (0, 1]. 

Lemma 11. Let U be uniformly distributed in (0, 1] and independent of the point process X. 
Then, for any k > 0, we have 

F{Z^{l) = k} = fi{k), 
where fi is the probability measure defined by 

Proof. Recall that the process (■^^(soc"))o<u<ln(i/so) ^ homogeneous Markov chain with 
transition intensity from state k to state k + 1 equal to f{k). Consider also the Markov chain 
{Zu)o<u<\n{i/so) with values in [so,l] x N started in Zq = (sqjO), such that at time u the 
jumping intensity from state {s,k) to state {s,k + 1) equals f{k), and from state {s,k) to 
state (soe",0) equals one. 

The following facts are easy to check: 

(1) The first coordinate Z^^-^^^^^^ of the chain {Zu)o<u<\n{i/so) time ln(l/so) is equal 
to So with probability sq and otherwise uniformly distributed on the interval [sq, 1]. 

(2) Conditionally on Z^^^^^^^^ = si, the second coordinate Z^^^^^^^^ has the same law as 
the random variable Z^{1). 

(3) The second coordinate (^u)o<u<in(i/so) is a homogeneous Markov chain, jumping 
from k to k + 1 with intensity f{k), and from k to zero with intensity one. 

The Markov chain stated in the third point was already encountered in [7]. It is recurrent 
and its law converges to its invariant measure, which is precisely fi. From the first two 
points, we deduce that the law of Z^^^^^^^ conditional on Z^^-^^^^ ^ sq is the same as the law 
of Z^{t), where U is uniform on [sq, !]• Now, letting sq go to zero gives the result. □ 

Proof of Theorem [II Let g( be a nondecreasing functional satisfying (^u, g^) < oo for some p > 
1. We win apply Theorem [7] with the functionals ^j(x,2:,W) := 5'(Zx(l, G*(Z U {x}, W))) , 
t E (0,oo], so that for x G X\ we have ^t(x,Af, V) = g{Z^{l)). 

First, observe that the expectation of S,oc{X, V) is {fi, g). Second, observe the following two 
simple consequences of the monotonicity property. The process (Z^(l))t>o is nondecreasing 
and converges almost surely to Z^'{1), which is finite almost surely. Moreover, the following 
uniform moment condition is satisfied, 

supE[6((0,C/),A',vn < E[Coo ((0,C/), A', ] = <oo. 
Hence, Theorem [7] ensures the convergence 

i ^5(Z*(1)) (^,5), 

in and thus in probability. Combining this with the well-known convergence \X^\/t ^ 1 
gives the convergence in probability 

and thus proves Theorem [TJ □ 
We close this subsection with a lemma which implies Proposition [5||i) . 
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Lemma 12. Almost surely, for any x = {x,s) E X, the incoming edges of x in G^{X,V) 
and in V) are a finite number and coincide for large t. 

Remark 6. The monotonicity property implies that the indegree of a vertex x in G^{X, V) 
converges almost surely to that in G°°{X,V) if the position of the vertex is zero, or in 
probability if its position is nonzero. The lemma guarantees that there is actually always 
almost sure convergence. 

Proof. We work conditionally on x = (x, s) £ X, and start by showing that there exists an 
almost surely finite random variable M such that, for all t G (0, oo] and y G X younger than 
x and at distance at least M of x, the vertices x and y are not linked in G^{X,V). 

The strategy is to find a coupling with a model independent of t, based on the observation 
that the distance between x and y in Tj can be shortened by at most 2\x\ compared to that 
in M. Let K be the number of vertices in X located at distance at most 2\x\ of x, which is 
an almost surely finite random variable. Consider the model where: 

• the vertices within distance 2\x\ of x are deleted, 

• the other vertices all come closer to x by distance 2\x\, 

• the attachment rule / is replaced by the rule fx - i ^ f{i + K). 

It should be clear that the vertices y G X younger than x, at distance at least 2\x\ of x, 
which are linked to x in some finite graph G*(Af,V), are also linked to x in this model. 
Furthermore, the indegree of x is still finite almost surely. Hence it suffices to choose M as 
the distance of x to the furthest younger vertex it is linked to in this model, plus 2\x\. 

Finally, all that is left to show is that the incoming edges of x linking it to a younger 
vertex y within distance M coincide in G*(Af,V) and in G°°(Af, V), for large t. This follows 
from the following two simple observations. First, the vertex x is linked to no other younger 
vertex beyond distance M - in G°°(Af,V) or in any G*(Af,V) - which could influence its 
indegree. Second, for t > \x\ + M, the vertices in X and in Af* within distance M of x 
coincide. Hence, for t > |x| + M, the vertex x has the same incoming edges in G^{X, V) and 
mG'^{X,V). □ 

5.2. Empirical outdegree distribution. The following proposition describes what we 
know about the expected outdegree distribution in the infinite picture. 

Proposition 13. For any u £ (0,1], the expected outdegree distribution, defined by the 
weights 

u{k) :=F{Y;^ = k}, A;GNU{oo}, (5.2) 
is independent of u. Moreover, the measure u is a probability measure on N (i.e. v{co) = 0) 
and it is light tailed in the sense that for any (5 G (0, 1 — 7), we have 

v{[k,oo)) = o{e~^^). 

Proof. The fact that v{k) does not depend on n is a simple consequence of the rescaling 
invariance property. Therefore we only consider u = 1 , and we watch for the law of , the 
outdegree of the point (0, 1) in the infinite picture. 

Attach to each vertex x G the value Vx := V((0, l),x). Then each vertex can be 
identified with a point of M x (0, 1] x (0, 1), and the set of vertices becomes a Poisson point 
process of intensity one on M x (0, 1] x (0, 1). The idea is to define a domain Ej^ such that 
the probability that there is any vertex in E}^ linked to (0, 1) is 0{e~^ ), and the probability 
that there are in total at least k vertices in the complement of -E^ (not necessarily linked to 
0) is also 0{e-^ ). This goes as follows: 
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Fix 6 £ (0, 1 — 7). Choose first 7' G [7, 1 — S) such that Inequahty (|4.5p is satisfied for 
some constant c G (0, 00) (this is always possible, in the linear case even with 7' = 7, 
see Corollary [TO] and Remark [5]). Then, choose 61, 82 such that 6 < 62 < Si < 1 — ^' . 
Introduce 



}■ 



= {(x, 5, n) G M X (0, 1] X (0, 1) : > /c^2 , s > 

Then, from Corollary 1 101 for any x = (x, s) and u such that (x, s, u) G -Bfc, we have 
^>-x|xGA',Vy = u|'<c exp ( — o-T.-i I < c exp \ —-^ \ . JT'.a 1 



P{(0, 1) X I X G Af,Vx = ti} < cexp 
Therefore, we get 



E 



\{{x,s,u) G Ek-. {x,s) G , Vx = ti, (0, 1) (2;,s)}| 



< 



< 



dx ds dn c exp 



\x\ 



87' \ 



K iX> ^ (u) J 



< 2 / (/?~^(n)dti 



dx du c exp 



1 / l^i \^'\ 

{u)) ) 



87' 



s_ ce:sip{- -^y^^) dy, 



[fc*2 ,00) 

with the change of yariable y = \x\/if~^{u). The first integral is equal to the integral of (p on 
[0,00), that is 1/2. For the second integral, introduce an appropriate constant Ci and get 

/ , cexp(-gi./^)dy< / , C7i^/^-^exp(-g^/^)dy<C7iexp(-A:''). 



The right hand side is a bound to the expected number of vertices in linked to (0, 1), and 
thus it is also a bound to the probability that there is any vertex in E^ linked to (0, 1). 

Now, with an easier calculation we get that the total Lebesgue measure of the complement 
of Ek is bounded by 



X (0,1] X (0,1) 



dxdnds (]l{y < A;*2} + l{s < y ~^}) 



< 2 



ip-^{u)du {Hy < k^2} + (I Ay) ~^)dy 

i(0,oo) 



and is therefore less than A;*2 plus a constant C2. As the total number of points of X in this 

_s_ 

domain is a Poisson variable of parameter less than fc*2 -\- C2, we have 

P| {{x,s,u) i Ek-. (x,s) G ^,Vx = u,{0,l) <-> (x,s)} > A;| 

with C3 another constant (we used Stirling formula at the second line) . Finally, the probabil- 
ity that the outdegree of (0, 1) is greater than or equal to k is less than (Ci + C3) exp(— A;"^). 
Hence i/([A;, 00)) = 0(exp(— A;'')), as claimed. □ 



20 



EMMANUEL JACOB AND PETER MORTERS 



The same proof, with the sets and their complements replaced by their restrictions to 
X G {-t/2,t/2] also yields 



with the same constants Ci and C3 for any u and t. Hence, the variables are stochastically 
dominated by a light-tailed random variable (this variable may not be , recall that Y^ is 
not monotone in t). 

Take g a function satisfying g{k) = 0(exp(A;'^)) for some 5 < 1 — 7, and define, 



for t G (0,00], so that ^f(x, Af,V) = giY^). The domination (j5.3p provides the uniform mo- 
ment condition (for any given p > 1). Theorem [2] follows, provided we prove the convergence 
in probability of ^t((0, n), A', V) to ^oo{{0,u), X ,V), for any u £ (0, 1]. The following lemma 
proves more, and also completes the proof of Proposition [5j 

Lemma 14. Almost surely, for any x = (x, s) G X, the outgoing edges ofx in G^{X,V) and 
in G°^{X, V) are finite in number and coincide for large t. 

Proof. Again, we suppose without loss of generality s = 1 and work conditionally on x = 
(x, 1) G X. Observe that if M is any finite number then, almost surely, all the indegrees of 
vertices in the graph G^{X, V) with spatial position in [x — M,x + M] go to the corresponding 
indegrees in G°°{X,V). Therefore, almost surely, the outgoing edges linking x to a vertex 
within distance M of x coincide in G^{X, V) and in G°°{X, V), for large t. The latter remains 
true if M is random, but finite almost surely. The lemma then follows if we show that there 
exists an almost surely finite random variable M such that for all t G (0, 00], for each x' G <^ 
at distance at least M of x, the vertices x and x' are not linked in G''{X,V). 

To prove this, we use again the coupled model introduced in the proof of Lemma [T2j 
Again, the vertices linked to x in some finite graph G*{X,V) are also linked to x in the 
coupled model. Furthermore, in the coupled model, it is clear that the outdegree of x is still 
finite almost surely, and we can simply choose M to be the distance of x to the furthest 
vertex it is linked to in this model, plus 2\x\. □ 

5.3. Clustering. 

5.3.1. Average clustering coefficient. In this part, consider, for t G (0, cxd], the functional ^£ 
and ^1 defined by 



with the convention ^^(x, Z,yV) = if x ^ V2 (G'*(2 U {x}, W)) , that is if x has degree 
less than two. Thanks to Proposition [5] and its corollary, we know that for any x, there is 
almost sure convergence of ^t(x, Af, V) to ^oo(x, A',V), and of ^^(x, A", V) to ^^(x, ^, V). In 
particular, condition [A) of Theorem [7] is satisfied for both functionals. Moreover, as they 
take values in [0,1], the uniform moment condition (B) is also satisfied. We immediately 
deduce the convergence in and in probability of 



nyu>k}<{Ci + C3)eM-k^) 



(5.3) 



ej(x, Z, W) := g{Y^{G\Z U {x}, W))) 



Ct(x,2:, 

^I(x,2, 



w) 



4°=(G*(Zu{x},W)), 

]i{xGy2(G*(zu{x},w))} 




and M 
t 
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to the constants E[Coo{{0,U), X ,V)] and F{{0,U) G V2 (G*(Af U {(0, C/)}, V)) }, respectively. 
Hence the average clustering coefficient converges in probability to 

C := K [urn U), X, V) I (0, U) G V2 {G\X U {(0, U)}, V))] . 

This constant is the expected local clustering coefficient of the infinite graph at vertex (0, U), 
conditionally on the event that its degree is at least two. It is hard to compute analytically, 
but it clearly belongs to (0, 1). The first part of Theorem [3] is proved. 

5.3.2. Global clustering coefficient. The estimation of the global clustering coefficient relies 
on separate estimations of the number of triangles and of the number of open triangles in 
the network. We choose to count the triangles from their youngest vertex, and define the 
functional i^t(x, Z, W) to be the number of triangles in G^{ZL){x}, W) having x as youngest 
vertex. Again, Proposition [5] ensures that condition (^4) is satisfied. The simple observation 
that ^((x, Af,V) is bounded from above by Y^{Y^ — l)/2, together with Inequality (|5.3p . 
ensures that the uniform moment condition (B) is satisfied for any p > 1, and we can apply 
Theorem [71 The number of triangles in the network V), divided by t, converges to 

a positive and finite constant. In other words, the number of triangles is asymptotically 
proportional to the number of vertices. 

Similarly, we introduce the functionals 

e;(x,Af,V) = ^iil^^iil^ and (x, A', V) = r^Z^W + ^^^^^ " 

where ^[ corresponds to the open triangles whose tip x is the oldest vertex, and (^'Z are the 
remaining open triangles with tip in x. For both functionals, Condition (A) follows again 
from Proposition [5l Condition (B) for functional is also automatically satisfied, for any 
1 < p < ^. More precisely, to bound the expectation of the product (Yf^Z^(l))P, first use 
their independence conditionally onU = u, then use the domination (15. 3p to bound uniformly 
E[(y^)^'], before integrating with respect to u. Therefore the number of open triangles whose 
tip is not the oldest vertex, divided by t, converges in probability to a positive and finite 
constant. 

It is only for the functional that we must discuss different cases. Suppose first ^ k'^fi{k) = 
00, which implies E[^^(A',V)] = 00. In that case. Theorem [7] and Remark H] imply that 
the number of open triangles with tip the oldest vertex, divided by t, goes to +00 in prob- 
ability. Hence, the global clustering coefficient converges in probability to zero. Finally, 
suppose ^A;^/i(A;) < 00 and hence E[^^(Af,V)] < 00. The monotonicity property implies 
that the variables ^^((0, [/), Af*, V*) are always uniformly integrable, even when Condition (B) 
is not satisfieci^, and allows to conclude that the global clustering coefficient converges in 
probability to a positive constant. 

5.4. Empirical edge length distribution. The law of the distribution Xt, the resettled 
empirical edge length distribution in the original graph Gt, is the same as the law of the 
unrescaled empirical edge length distribution in the graph G*(^,V), which we will denote 
by Xt- We have, abbreviating := V*)) and assuming it is not empty, 

(x',x)e-B* xeA"* xeA"* x'ga-«,x'«x 

older than x 

"'^If 7 < |, then (B) holds for any 1 < p < but if 7 = | and Y2 k^f-i^{k) < 00, then (B) does not hold. 
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where we have chosen to count each edge from its younger vertex. Define the probabihty 
measure A on (0, +00) by 

X{A)= E[\{{x,s)€X: {x,s)^{0,U),s<U,\x\£A}\], 

for any Borel set A C [0, 00), where U denotes a random variable uniformly distributed on 
(0, 1) and independent of X and V. By application of Theorem[7|we get, for any x G [0, 00), 

Xt{[x,oo)) — > A([x,oo)), 

in probability. Hence there is convergence in probability of A* to A in the space of probability 
measures with the weak topology. This proves the first part of Theorem HI 

Next we estimate the order of X{[K, 00)) when K is large. Fix K > 0. We have 

X([K,oo)) =2 1 dx(^dt®du®ds¥{ix,s) o (0,t) I V((x, s), (0, t)) =u\, 
Jn 00 I 

where O is the domain \^{x,t,u, s) G [K,oo) x (0,1)"^: s < t}. The factor two comes from 
the fact that we have chosen x > 0. The linking probability contains an implicit conditioning 
on the event that {x,s) and (0,t) are in X. As in the proof of Corollary [10] we can rewrite 

¥{{x,s)^{0,t)\V{{x,s),{0,t))=u} = F{Z^{t)>r\tx/ip-Hu))} 

= ¥{Zf^,{l)>r'{tx/^~\u))}, 

where /"^ is the right-continuous inverse of /. Changing the variable 

(x, t, u, s) ^ {y, z, u, r) with y = ^^r^, z = ^^^^^^,r = f , 

sending Q to $7' = {{y,z,u,r) G (0,oo)'^ x (0,1), z < y, u< (p{z)} we get 



A([K,oo)) = 2K-i / dyy( [ dz [ Au ^-\u))( f dr P{Z~(1) > /"Hy)}) 

POO 

= 2K-iy^ dyyl{f)Jiy), 

with / and J defined to be the two integrals in brackets in the first line. For an estimate 
of J, we simply note that J{y) = iJ,{\f^^{y)], 00)) x lAy~^/^. For an estimate of I we start 
with the equality 

/ du(p~^{u)= / (p{z\/v)dv, 

J{0,<p{z)) 7(0,00) 

based on the observation that they both represent the area of 

{{u, v) G (0, 00)^ : u < f{z),v < (f{z)], 

to get 

1(a) = / dz dv (p{z V v) = 2 / v(p{v)dv + a / (p{v)dv. 

J{0,a)x{0,oo) J{0,a) J (a,oo) 
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Now, elementary calculations yield 

{aAl if / Vip{v)dv<oo, 

Jo 
aAa^-^ if 99(1;) X 1 Av-"^ for 5 G (1,2]. 

Finally, another elementary calculation shows that we have 

X{[K, 00)) X 1 A {K~^ + K^~^ + K^^^), 

and Theorem U follows. 

6. Variants of the model 

6.1. Discrete versus continuous time. We have decided to define our model in continuous 
time, as this is naturally aligned with our techniques of proof. We expect that all our results 
hold without change for the analogous discrete model, but we have not attempted to derive 
this from our results as we do not expect to get interesting insights from this. We point 
out that the weak law of large numbers in [l7j includes a de-Poissonization, but this cannot 
be applied directly in our case as it does not deal with the explicit time dependence of the 
attachment probabilities. 

6.2. The case 7 > 1. This assumption leads to a very different behaviour, which we briefly 
discuss. Lemma 8 does not hold anymore. Instead, the indegree of a fixed vertex (the oldest 
one, for example), grows roughly linearly, and it will be eventually connected to a positive 
proportion of the younger vertices. The length of its incoming edges is thus of order one. 
The law of large numbers. Theorem 7, holds unchanged, as well as Theorem 1. That said, 
we have Yl kfj,{k) = 00, which implies that the total number of edges is superlinear. The 
empirical outdegree distribution converges vaguely to the null distribution, as all the mass 
escapes to infinity. In the infinite picture, the outdegree of each vertex is almost surely 
infinite. Finally, the same phenomenon happens to the empirical edge length distribution, if 
we still rescale it by the same factor of t. 

6.3. Higher dimensional space. We have chosen to present our results for spatial distri- 
butions given as uniform distributions on the one-dimensional torus to keep technicalities to 
a minimum. There is no problem generalising results to higher-dimensional tori T"^, in fact 
if we connect the vertex y = (y, t) to an older vertex x with probability 

/ tV'^ci(x,y) \ 

and normalise the profile function so that 

/ ip{d{0,y))dy = l, 

we can recover Theorem 1,2 and 3 verbatim by the same arguments. In the empirical edge 
length distribution we need to rescale by a factor of ta instead of t, and we obtain a limiting 
edge length distribution A, which depends on the dimension. If the profile function scales 
like f{x) X 1 A we need to have 6 > d to meet the integrability condition. Then we 
recover Theorem 4 with r] G (0, 2d — 1] the smallest of the three constants 2d — 1, 6 — 1 and 
d{- — 1). If 77 > 1 then A has a first moment, and the mean edge length is of order t~d. 
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6.4. More general underlying spaces. It is no problem to define our model in a general 
metric space. However this can lead to a significant change in the behaviour, as inhomo- 
geneities in the underlying space introduce an element of fitness of individual vertices. In a 
similar spirit one can change the spatial distribution of incoming vertices. Again one would 
expect that small changes do not change the qualitative behaviour, whereas highly fluctu- 
ating densities can have a major effect. These problems have recently been discussed by 
Jordan [H] for a closely related model. 

6.5. Further remarks and problems. Our technique allows the analysis of a wide range 
of functionals of spatial preferential attachment networks, and we have only picked those 
that appeared most interesting to us at this point. Other network 'metrics' that could be 
studied are the total edge length, the number of occurrences of a particular finite subgraph 
(or motif), or the number of (suitably defined) high density spots. 

More generally, the local limit results established here offers a handle to the study of global 
connectivity problems, for example the existence and diameter of a giant component. This 
would be of particular interest as, to the best of our knowledge, nontrivial rigorous results 
on the existence of the giant component have never been established for dynamic network 
models that are not locally tree-like. A first discussion including a conjecture for the location 
of a phase transition related to the existence of a giant component in the model of [1] can 
be found in [6]. 
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